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SUMMARY  OF  TECHNICAL  PROGRESS 


Robotic  planning,  if  it  is  to  be  successful  in  real-world  situations,  must  find  some  way  to  side-step 
the  now-well-documented  obstacles  to  classical  AI  planning.  These  recent  results  show  that  the 
computational  complexity  of  standard  planning  is  unacceptable  even  with  drastic  and  untenable 
simplifying  assumptions  about  the  world.  The  source  of  complexity  in  real-world  robotic  domains 
includes  the  problems  of  data  uncertainty,  large  amounts  of  data  to  consider,  as  well  as  the  problem  of 
tractably  producing  plans  according  to  the  given  domain  rules.  Pretending  that  these  complexities  do 
not  exist  relegates  a  computer  system  to  a  trivialized  micro-world  with  little  hope  of  applications  to 
the  real  world. 

The  research  of  this  grant  has  been  directed  towards  dealing  with  the  real-world  constraints  that  arti¬ 
ficial  intelligence  robotics  systems  must  address.  We  have  made  significant  progress  on  two  fronts. 
The  first  investigates  an  integrated  approach  to  planning  wherein  a  classical  a  priori  planner  is  aug¬ 
mented  with  reactive  abilities.  Classical  planning  enables  the  construction  of  provably-correct 
plans,  but  requires  perfect  a  priori  information  to  do  so.  In  the  real  world,  a  perfect  characterization 
of  a  domain  is  likely  to  be  intractable,  and  thus  classical  planning  has  been  limited  to  toy  domains. 
Reactivity  provides  sensitivity  to  the  execution  environment  and  flexibility  in  planning.  However, 
being  essentially  a  hill-climbing  approach  to  planning,  it  can  neither  guarantee  eventual  nor  effi¬ 
cient  goal  achievement.  Ours  represents  an  integration  of  the  two  approaches  to  gain  their  individu¬ 
al  strengths  while  dealing  with  their  limitations.  This  integration  provides  a  classical  planner  with 
the  ability  to  defer  goals  which  it  can  prove  achievable,  and  to  then  address  these  goals  during  execu¬ 
tion.  From  reactive  planning  is  gained  the  ability  to  utilize  runtime  information,  eliminating  the  need 
for  perfect  a  priori  information.  As  an  augmented  classical  planner,  this  integrated  approach  retains 
the  goal-directedness  afforded  by  a  priori  planning.  Through  the  achievability  constraint,  this  inte¬ 
grated  approach  also  retains  the  provably-correct  nature  of  plans  constructed  in  classical  planning. 
We  have  implemented  a  prototype  system  to  aid  in  our  investigations.  Our  progress  has  been  docu¬ 
mented  in  the  technical  literature. 


I  ~  The  underlying  unification  is  performed  by  a  new  planning  technique  we  developed  called  contin- 
•<  ..  gent  explanation-based  learning.  As  in  the  second  approach,  learning  is  a  key  ingredient.  In  contin- 

g  g  ^  gent  EBL,  deferred  goals  are  represented  using  conjectured  variables,  which  act  as  placeholders  for 

iJS  2  |  the  eventual  values  of  plan  parameters  whose  values  are  unknown  prior  to  execution.  Contingent 

j  £  %  EBL  extends  traditional  EBL  by  allowing  a  planner  to  distinguish  between  decisions  made  prior  to 

'&5 1  U  q  execution  and  decisions  made  during  execution.  This  is  done  through  the  use  of  conjectured  vari- 

'q  B  |  ables,  which  act  as  placeholders  for  deferred  planning  decisions.  ContingentEBL  also  incorporates 

?  xj  I  completors  into  general  plans.  Completors  are  responsible  for  the  runtime  determination  of  values 
-  'i  |  i  to  replace  the  conjectured  variables..  Only  conjectured  variables  with  accompanying  achievability 

£  |  proofs  are  allowed  into  contingent  explanations,  and  thus  the  general  plans  learned  in  contingent 

1  EBL  are  guaranteed  to  be  computable. 
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The  second  thrust  of  this  grant  explores  a  new  approach  called  permissive  planning.  We  have  im¬ 
plemented  our  ideas  in  the  GRASPER  system  which  has  capabilities  to  monitor  execution  of  its 
plans  and  to  tune  its  model  of  the  world  on  failure  through  use  of  explicit  approximations.  We  have 
tested  GRASPER  extensively  on  two  domains:  achieving  stable  robotic  grasping  of  novel  objects, 
and  tray-tilting  to  orient  parts  (a  domain  developed  at  CMU  that  we  adopted  for  comparison  pur¬ 
poses).  Approximations,  as  employed  by  GRASPER,  involve  tunable  continuous  quantities.  These 
approximations  include  controls ,  which  directly  control  some  parameter  in  the  world,  constraints , 
which  help  to  evaluate  the  choice  of  one  from  a  set  of  candidates,  an d  weights,  which  evaluate  con¬ 
straints  with  respect  to  each  other.  The  system  is  being  employed  in  the  robotic  grasping  domain  to 
improve  its  performance  at  grasping  children’s  puzzle  pieces.  Approximations  are  tuned  on  failure 
to  yield  better  future  performance.  This  has  the  effect,  with  failures  due  to  uncertainty,  of  reducing 
failures  by  tuning  approximate  rules  so  as  to  be  more  uncertainty  tolerant. 
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DOD  INTERACTIONS 

DOD  interactions  have  centered  around  conferences  and  workshops.  Scott  Bennett,  a  graduate  stu¬ 
dent,  and  the  PI  attended  and  presented  at  the  Naval  Research  Workshop  on  Knolwedge  Acquisition 
in  November,  1989.  In  August,  the  PI  attended  the  DOD  workshop  on  Autonomous  Vehicles  in 
Crystal  City.  Several  interactions  involved  personnel  from  the  Naval  Research  Laboratory  and  our 
research  group  at  the  AAAI  and  Machine  Learning  conference,  and  the  Machine  Learning  Program 
Committee  meeting. 
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DESCRIPTION  OF  SOFTWARE/HARDWARE  PROTOTYPES 

Two  prototypes  have  resulted  from  this  research.  The  first  demonstrates  a  Machine  Learning  AI 
system  connected  to  a  scara-type  RTX  robot  arm  and  a  simple  camera!  The  system  is  presented  with 
real-world  objects  of  rather  complex  shapes.  These  were  not  manufactured  by  the  research  team  but 
are  from  a  children’s  plastic  jigsaw-like  puzzle.  The  system  has  no  prior  knowledge  of  the  piece 
shapes  or  information  on  how  to  grasp  them.  After  taking  a  picutre  and  approximating  their  shape, 
the  system  postulates  a  grasping  strategy  and  tries  it  out  with  the  robot  manipulator.  When  the  initial 
strategy  fails  (because  of  uncertain  or  approximated  information)  the  system  diagnoses  the  failure 
and,  using  the  explanation,  propagates  the  qualitative  uncertainty  from  the  real-world  to  parameters 
within  the  grasping  strategy.  A  new  strategy  is  produced  that  is  as  uncertainty-tolerant  as  possible 
for  the  diagnosed  class  of  uncertainties.  The  resulting  strategy  is  available  together  with  sufficient 
indexing  information  for  future  grasping  episodes.  The  prototype  system  has  been  run  in  several 
data-collection  experiments,  the  empirical  findings  confirm  the  theoretical  predictions  of  continu¬ 
ously  improved  grasping  success  on  novel  objects.  The  second  system  demonstrates  a  combined 
classical/reactive  planner  of  the  sort  described  earlier.  It  is  far  less  developed  but  nonetheless  dem¬ 
onstrates  a  proof  of  concept  that  reactive  operators  can  co-exist  within  standard  operators  in  a  classi¬ 
cal  planner.  An  experiment  with  the  prototype  demonstrates  that,  under  certain  circumstances,  such 
an  integrated  approach  provides  significant  improvement  over  classical  planning  alone,  while  pre¬ 
serving  the  domain-independence  advantages  of  classical  planning  over  a  purely  reactive  approach. 


